Short-Text Semantic Similarity (STSS): Techniques, Challenges and Future Perspectives
نویسندگان
چکیده
In natural language processing, short-text semantic similarity (STSS) is a very prominent field. It has significant impact on broad range of applications, such as question–answering systems, information retrieval, entity recognition, text analytics, sentiment classification, and so on. Despite their widespread use, many traditional machine learning techniques are incapable identifying the semantics short text. Traditional methods based ontologies, knowledge graphs, corpus-based methods. The performance these influenced by manually defined rules. Applying measures still difficult, since it poses various challenges. existing literature, most recent advances in research not included. This study presents systematic literature review (SLR) with aim to (i) explain sentence barriers similarity, (ii) identify appropriate standard deep for text, (iii) classify models that produce high-level contextual information, (iv) determine datasets only intended (v) highlight challenges proposed future improvements. To best our knowledge, we have provided an in-depth, comprehensive, trends, which will assist researchers reuse enhance information.
منابع مشابه
Benchmarking short text semantic similarity
Short Text Semantic Similarity measurement is a new and rapidly growing field of research. “Short texts” are typically sentence length but are not required to be grammatically correct. There is great potential for applying these measures in fields such as Information Retrieval, Dialogue Management and Question Answering. A dataset of 65 sentence pairs, with similarity ratings, produced in 2006 ...
متن کاملText-to-Text Semantic Similarity for Automatic Short Answer Grading
In this paper, we explore unsupervised techniques for the task of automatic short answer grading. We compare a number of knowledge-based and corpus-based measures of text similarity, evaluate the effect of domain and size on the corpus-based measures, and also introduce a novel technique to improve the performance of the system by integrating automatic feedback from the student answers. Overall...
متن کاملDown Syndrome: Current Status, Challenges and Future Perspectives
Down syndrome (DS) is a birth defect with huge medical and social costs, caused by trisomy of whole or part of chromosome 21. It is the most prevalent genetic disease worldwide and the common genetic cause of intellectual disabilities appearing in about 1 in 400-1500 newborns. Although the syndrome had been described thousands of years before, it was named after John Langdon Down who described ...
متن کاملA Comparative Study of Two Short Text Semantic Similarity Measures
This paper describes a comparative study of STASIS and LSA. These measures of semantic similarity can be applied to short texts for use in Conversational Agents (CAs). CAs are computer programs that interact with humans through natural language dialogue. Business organizations have spent large sums of money in recent years developing them for online customer selfservice, but achievements have b...
متن کاملECNUCS: Measuring Short Text Semantic Equivalence Using Multiple Similarity Measurements
This paper reports our submissions to the Semantic Textual Similarity (STS) task in ∗SEM Shared Task 2013. We submitted three Support Vector Regression (SVR) systems in core task, using 6 types of similarity measures, i.e., string similarity, number similarity, knowledge-based similarity, corpus-based similarity, syntactic dependency similarity and machine translation similarity. Our third syst...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13063911